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Outage probability and capacity of a class of block-fading MIMO channels are considered with 
partial channel distribution information. Specifically, the channel or its distribution are not known but 
c/2 ' the latter is known to belong to a class of distributions where each member is within a certain distance 

o : 

(uncertainty) from a nominal distribution. Relative entropy is used as a measure of distance between 

distributions. Compound outage probability defined as min (over the transmit signal distribution) -max 

> • 

C*") ■ (over the channel distribution class) outage probability is introduced and investigated. This generalizes 

O ■ 

the standard outage probability to the case of partial channel distribution information. Compound outage 
probability characterization (via one-dimensional convex optimization), its properties and approximations 
are given. It is shown to have two-regime behavior: when the nominal outage probability decreases (e.g. 
by increasing the SNR), the compound outage first decreases linearly down to a certain threshold (related 
to relative entropy distance) and then only logarithmically (i.e. very slowly), so that no significant further 
decrease is possible. The compound outage depends on the relative entropy distance and the nominal 
outage only, all other details (nominal fading and noise distributions) being irrelevant. The transmit 
signal distribution optimized for the nominal channel distribution is shown to be also optimal for the 
whole class of distributions. The effect of swapping the distributions in relative entropy is investigated 
and an error floor effect is established. The compound outage probability under L p distance constraint 
is also investigated. The obtained results hold for a generic channel model (arbitrary nominal fading 
and noise distributions). 
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Index Terms 

- compound MIMO channel, outage probability/capacity, channel distribution uncertainty, relative 
entropy distance 

I. Introduction 

Multiple-input multiple-output (MIMO) wireless systems have received significant attention 
due to the promise of high spectral efficiency flDED, which has been extensively investigated. 
As with any wireless system, their channel capacity depends significantly on the channel state 
information available at the transmitter and the receiver as well as the fading statistics experienced 
by the channel [3J. When the fading process is egrodic (i.e. the channel "reveals" its statistics 
to a single codeword), an appropriate performance indicator is ergodic capacity (DIED. On the 
other hand, when the channel is block-fading (or quasi-static), i.e. stays fixed during a codeword 
transmission and changes from codeword to codeword, its Shannon capacity is zero in many 
cases of practical interest (i.e. Rayleigh fading) so that outage capacity (capacity versus outage) 
and outage probability (for given target rate) are appropriate performance indicators [ID -El- In 
the block-fading (quasi- static) regime, the channel capacity is not affected by the receiver channel 
state information U but depends significantly on the channel state information available at the 
transmitter [3j[9][12j. Since the channel state information is obtained via channel measurements, 
its accuracy may be limited due to variability and difficult propagation conditions (e.g. low SNR) 
in a wireless channel. The channel state information at the transmitter is further limited due to 
limitations of the feedback channel (if any). This situation can be modeled via a compound 
channel model, where the true channel is not known but it is known to belong to a certain 
(limited) class of channels and the corresponding compound channel capacity theorems have 
been established flSJ- [[8]| - While these theorems treat all channels in the class equally and build a 
code that performs well on any such channel, the corresponding capacity is typically limited by 
the worst channel in the class and may be low, even though most channels in class are good and 
the worst channel is realized with low probability, i.e. it is a conservative performance indicator. 
To avoid this problem, a concept of composite channel has been introduced [8] [19], where each 

'since the receiver can always learn the channel via a training sequence, which results in asymptotically-negligible loss in the 
capacity in the quasi-static mode |3|, full CSI at the receiver can be assumed that significantly simplifies the analysis. 
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channel in a class has associated probability measure, so that bad low-probability channels do 
not penalize significantly the performance metric. The corresponding channel capacity theorems 
can be proved via the concept of information density f[T7llfT9l or using the compound channel 
approach 01lfi2l. 

Another possibility to model the uncertainty of channel state information is to assume that 
the transmitter knows only the channel distribution but not the channel itself. A number of 
results on MEMO channel capacity have been obtained under this assumption [|9||-[fT2|. A com- 
prehensive review of the impact of channel uncertainty on its performance and corresponding 
coding/decoding strategies can be found in [8]. A concise review of more recent activities on 
MIMO channels is available in |fl3ll . The compound MIMO channel capacities under the trace 
and spectral norm constraints have been studied recently in lfT3l - [fT5l . A construction of a code 
approaching the compound channel capacity can be found in |fl6l . 

In this paper, we consider a situation where even the channel distribution information is not 
available at the transmitter; rather, the transmitter knows that the channel distribution belongs to 
a certain class centered around a nominal distribution. This models a practical scenario where 
the channel distribution information is obtained from multiple but limited measurements, so 
that the true distribution is known only with finite accuracy (typically related to the number of 
independent samples used for estimation). This also models a dynamic scenario where the channel 
distribution information obtained from past measurements may be outdated. The uncertainty 
in the channel distribution information may also be related to the limitation of the feedback 
channel used to supply this information to the transmitter. We assume a quasi-static (block- 
fading) scenario so that channel state information at the receiver is irrelevant. Our channel 
model is quite generic: we do not assume any particular nominal channel distribution and even 
the channel noise can be arbitrary (except for examples, where particular distribution and noise 
are considered), so that the results are general too. Relative entropy between two distributions is 
used as a measure of distance, so that the distribution uncertainty class includes all distributions 
within certain relative entropy distance of the nominal one. Similar approach was adopted in lETl 
to study the ergodic capacity under channel distribution uncertainty El and in [|22l to investigate an 

2 while the impact of channel distribution on the capacity is quite mild in the ergodic regime (due to averaging over the 
channel statistics), it is much stronger in the non-ergodic regime (no averaging) |4]. 
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optimal control of stochastic uncertain systems. A justification of relative entropy as a measure 
of distance between distributions can be found in e.g. [|28lll29ll . Our results on compound outage 
probability provide further justification, as they indicate that the relative entropy distance limits 
the achievable outage probability (capacity) via the error floor effect. When the nominal outage 
probability is negligible and the distance is small, the compound outage probability equals to 
the relative entropy distance (regardless of all other details). 

Since the channel is block-fading, the outage probability and capacity are considered as main 
performance metrics, which we term "compound outage probability /capacity" to emphasize that it 
applies to a class of fading distributions (i.e. "compound distribution") rather than any particular 
one. This parallels the concept of compound channel, where a code is designed to operate on 
any member in the class. In our case, a code is designed to operate for any channel distribution 
in the class, so that the compound outage probability involves maximization over all feasible 
channel distributions and minimization over the transmitted signal distribution (subject to the 
power constraint), and the corresponding compound outage capacity is derived from it. We also 
consider a scenario where the transmitted signal distribution is fixed a priori (e.g. universal code 
design). 

The system/channel model and the performance metrics (outage probability and capacity) are 
introduced in Section HO Compound outage probability is defined and investigated in Section 
Hill which includes its closed-form characterization in Theorem \T\ (as one-dimensional convex 
optimization problem) and the worst channel distribution (which is a piece-wise constant scaling 
of the nominal distribution). Remarkably, the compound outage probability depends only on 
the nominal one and the relative entropy distance, all other details (e.g. nominal fading and 
noise distributions) being irrelevant. Properties of the compound outage probability are given 
in Propositions [TJ|3l and its two-regime asymptotic behavior is identified in Section IIII-BI 
Specifically, as the nominal outage probability decreases (say by increasing the SNR), the 
compound outage probability first decreases linearly too, but after a certain threshold (equal to 
about the relative entropy distance when the latter is small), it decreases only logarithmically, i.e. 
very slowly, so that significant decrease is not possible anymore. Optimizing the transmit signal 
distribution in this regime does not bring in significant improvement either so that any reasonable 
distribution (e.g. isotropic signalling) will do as well. Compact, closed-from approximations are 
obtained for the compound outage probability in these two regimes using the tools of asymptotic 
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analysis. Theorem |2] shows that the transmit signal distribution optimal for the nominal channel 
distribution is also optimal for the whole class, so that known optimal transmit covariance 
matrices (see e.g. BUS- [ID]) can be "recycled". 

Since relative entropy is not symmetric, Section JV] investigates the impact of this asymmetry 
on the outage probability. Swapping the distributions (nominal and true) is shown to result in the 
error floor effect: the compound outage probability is bounded away from zero, does not matter 
how low the nominal outage (or how high the SNR) is. The error floor depends on the relative 
entropy distance: it decreases with it and when it is small, they are equal, so that the relative 
entropy distance also serves as the compound outage probability in this regime. Theorem |3] 
provides a closed-from expression for the compound outage probability that depends on a unique 
solution of a single non-linear equation. An alternative characterization is via one-dimensional 
convex optimization. The worst-case channel distribution and properties of the compound outage 
probability are also given, including its two-regime behavior and compact approximations via 
asymptotic analysis. 

Based on the results above, compound outage capacity is studied in Section |V] In Section IVH 
the class of distributions is considered where the distance is defined via L p norm (a particular 
case of p = 2 corresponds to popular mean- square-error estimation) and its outage probability 
is related to the results above. In particular, the compound outage with relative entropy distance 
serves as a lower bound for this case and the error floor effect is present as well. Section IVIII 
concludes the paper. Proofs are given in the Appendix. 

II. System Model, Outage Probability and Capacity 

Let us consider a generic discrete-time baseband multiple-input multiple-output channel as 
shown in Fig. [T] where x and y are the input (transmitted) and output (received) vectors (or 
sequences), and H denotes channel state. In the general case, the channel is described by the 
conditional probability distribution of y given x and H, W (y|x, H), and the mutual information 
(per channel use) supported by the channel for a given distribution of x and channel state H is 
7(x;y|H). We assume that the channel is block-fading (non-ergodic), i.e. a particular channel 
realization H is selected in the beginning and stays fixed for the whole duration of codeword 
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transmission; next codeword will see a different channel realization Q Channel fading distribution 
is described by its probability density function /(H). Most of our results will hold in this 
generic scenario, which includes as special cases frequency-selective (inter-symbol interference) 
or frequency-flat (no ISI) Gaussian MIMO channels^. 



X , Channel: y , 
7 noise & fading y 

W(y|x,H) 



Fig. 1. Generic discrete-time basedband MIMO block-fading channel model. No assumptions on noise and fading distributions 
are made. 



We will not assume any particular fading and noise distribution (except for examples) so that 
our results are general and apply to any such distribution. The transmitted signal, receiver noise 
and the channel are assumed to be independent of each other. We also assume that the transmitter 
does not known the channel but only has a partial knowledge of its distribution (as explained 
later on); channel knowledge at the receiver is irrelevant in the block-fading (i.e. quasi-static) 
environment. 

In the special case of Gaussian MIMO channel, the channel model becomes 

y = Hx + £ (l) 

where £ is the additive white complex circularly-symmetric Gaussian noise and H is the chan- 
nel matrix, whose (i,j)-th entry is the channel gain between i-th output and j-th input (e.g 
receive/transmit antennas, time or frequency slots etc., see e.g. [fT2l for more details). As is well- 
known, when the noise is Gaussian, the optimal signaling is also Gaussian (see e.g. [[TJ) so that 
complex circularly- symmetric Gaussian x is optimal. Its distribution is completely characterized 
by its co variance matrix R x = E[xx^], where t denotes Hermitian conjugation. For a given 
channel realization H, a given transmit covariance R x achieves the celebrated log-det mutual 

3 With a slight modification in notations, this block-fading model can also be extended to the case where each codeword sees 
a finite number of channel realizations, e.g. as in 

(12(111, and our results will hold in that case as well. 
4 and can also be extended to include the finite block-length regime considered in 1201 . 
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information in this Gaussian MIMO channel |fTflll2ll, 

J(x;y|H) = hi|I + 7 HR x Ht| (2) 

where | • | denotes determinant, I is the identity matrix, 7 is the average SNR per Rx antenna. 

Main performance metrics in the block-fading regime are outage probability and outage 
capacity Oil Il4l |[T2l 0. Outage probability is the probability that the channel is not able to support 
the target rate R. When the transmitter knows the channel distribution (but not the channel itself), 
the outage probability is 

Pout{R)= min Pr{/(x;y|H) < R} (3) 

trR x <PT 

where Pr{J(x;y|H) < R} is the outage probability for a given transmit covariance and the 
minimization is over all possible transmit covariance matrices satisfying the power constraint, 
P T is the total transmit poweilj. Outage capacity is defined as the largest possible rate such that 
the outage probability does not exceed the target value 8, 

C s = maxji? : P out {R) < 5} (4) 

Clearly, P ou t(Cos) = S. Finally, one may also consider the outage probability and capacity for a 
given (fixed) transmit covariance. Achievability of the outage capacity/probability follows from 
the compound channel capacity theorem fl6l-ll8l lfT2l (which guarantees an existence of a code 
that works on every channel in the no-outage set); see also [fr71llfT9ll for a modern treatment 
using the concept of information density. 

III. Compound Outage Probability for a Class of Channels 

Consider the scenario where the transmitter has only partial channel distribution information. 
Namely, it knows that the channel probability density function (PDF) /(H) is within a certain 
distance of the nominal distribution /o(H). We use the relative entropy as a measure of the 

It can be further shown that the outage probability is the best achievable average codeword error probability |HT|-[[T9l. 
6 In the general case, the minimization in lf3} is over all possible distributions of x subject to the power constraint. To simplify 
notations, we will use below the notations of (f3}, which apply to the Gaussian MIMO channel in Q}, with understanding that 
the same results hold for the general MIMO channel as well. 
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distance between two distributions, so that all feasible distributions / satisfy the following 
inequality: 

D(f\\fo)= [ f\n^dii<d (5) 
J Jo 

where D(f\\f ) is the relative entropy or Kullback-Leibler distance between the distributions, 
and d is the maximum possible distance in the uncertainty set to which / belongs. Throughout 
the paper we assume that d < oo. In this scenario, the definition in (Q does not apply (since 
the true distribution / is not known) but can be generalized to 

P* = min max Pr{/(x; y|H) < R} (6) 

trR x <P T D(/||/o)<d 

and the outage capacity can be defined as in © with the substitution P out — >■ P* ut . Its achievability 
also follows from the compound channel capacity theorem [7|[8llfl2ll or from lfl7llfT9l , since the 
optimal signalling does not depend on the true channel distribution, but only on the nominal one 
(and also the relative entropy distance d). This problem setup models a practical situation where 
the channel distribution information is obtained from measurements or physical modeling, which 
are never perfect. It also accounts for the fact that the estimated channel distribution may change 
with time in dynamic scenarios. We term P* ut in © "compound outage probability" since it is 
a performance measure of a class of channel distributions rather than a single distribution. This 
approach parallels the work on compound channel capacity [l5l-[l8l [fT3l - lfT5l where the channel 
is not known to the transmitter but it is known to belong to a certain class. 

To characterize the compound outage probability P* ut , we adopt a two-step approach: first, 
we characterize the outage probability for a given Tx covariance matrix (i.e. no minimization 
in ©, which also represent a practical situation where the transmit covariance is set a priory); 
then it is minimized over all feasible transmit covariances. 

A. Step 1 : Compound Outage for a Given Tx Covariance 

When the Tx covariance R x is giverJz , the compound outage probability is 

P oui (R x )= max Pr{/(x;y|H) < R} (7) 

D(f\\fo)<d 

Its characterization is strikingly simple in the generic scenario, i.e. for any noise and nominal 
fading distribution. 

7 In the general case, this corresponds to a given distribution of x. 
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Theorem 1: For a given Tx covariance R x and arbitrary nominal fading distribution / , the 
outage probability in © can be expressed as 

P 0Ut (Rx) = min[s ln(l + {e l ' s - l)e) + sd] (8) 

s>0 

where 

e= / /o^H (9) 

/(x;y|H)<i? 

is the nominal outage probability (i.e. the outage probability under the nominal channel distri- 
bution). The worst channel distribution /* (the maximizer in ©) is given by 

_ (eV - l)l(H) + 1 
/ " (e i/.._i )e + i ^ (10) 

where s* is the minimizing s in ©, and £(H) is the indicator of the outage set: ^(H) = 1 if 
J(x;y|H) < R and otherwise. 

Proof: see Appendix. ■ 

Note that Theorem Q] effectively reduces the infinite-dimensional optimization problem in © 
(the optimization there is over the set of all admissible distributions /) to one-dimensional 
convex optimization in ®, which can be effectively solved using numerical algorithms. This is 
accomplished using Lagrange duality theory (see Appendix for details). As we will see below, 
this is not the only advantage: © also provides a number of insights unavailable from ©. It is 
remarkable that the nominal outage distribution enters the compound outage probability in © 
only via the nominal outage probability e, all other its details being irrelevant, i.e. two different 
nominal distributions with the same nominal outage probability will produce the same compound 
outage probability. 

Note that the maximizing density /* in (flOl) mimics the nominal one f in a piece-wise 
constant manner: 

where O = {H : £(H) = 1} is the outage set, so that the right-hand side of (fTTT ) is independent 
of H in each set and /* is a scaled up version of f in the outage set and scaled down otherwise 
- see Figj2j 
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Fig. 2. The worst channel distribution over the nominal one. Note that the worst distribution is a piece-wise constant scaling 
of the nominal one. 



An additional advantage of © is that the resulting optimization problem there is convex, i.e. 
the function 

L(s) = s ln(l + (e 1/s - l)e) + sd (12) 

is convex in s > (see Appendix) so that P out (R x ) = min s > L(s) in © can be solved 
efficiently using any known numerical algorithm (the solution is unique and satisfies dL(s)/ds = 
0). Alternatively, the tools of asymptotic analysis (see e.g. [|25l[|26l ) can be used to obtain 
approximations (see Section IIII-BI) . This dual representation also provides a number of insights, 
as indicated below. 

Let us now consider the outage probability in dE) as a function P ou t(d) of the distance d. A 
number of its properties follow. 

Proposition 1: For a given covariance R x , the compound outage probability P ou t(d) as a 
function of distance d has the following properties: 

1) Pout(d) is concave in d > 0. 

2) P ou t(d = 0) = e, i.e. the compound outage probability equals the nominal one when d — 0. 

3) P out (d) is a non-decreasing function of d, that is 

Pout{di) < PouM, < di < d 2 < oo. (13) 
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and the equality holds if and only if P out {di) = P ou t{d2) = 1,0 i.e. P out {d) is a strictly increasing 
function of the distance d unless P out = 1,0. 

Proof: see Appendix. ■ 

Proposition 2: The compound outage probability P out has the following properties: 

1) Pout = 1 if and only if e — 1. 

2) Pout = if and only if e — 0. 

3) Pout > £, and the equality holds if and only if d = or e = 0, 1. 

Proof: see Appendix. ■ 

While in general the compound and nominal outage probabilities can be very different (as 
property 3 above shows), the compound outage takes on a limiting value (either or 1) if and 
only if the nominal outage does so. 

Proposition 3: The compound outage probability in © is a strictly-increasing, concave func- 
tion of the nominal outage e, i.e. 

Pautfa) < Pout(e 2 ), 0<e 1 <e 2 <l. (14) 

with the boundary conditions P ou t(s = 0) = 0, Pout(e = 1) = 1. 

Proof: see Appendix. ■ 

Proposition 4: The compound outage probability P out in Theorem Q] can be bounded as 
follows: 

e<Pout <min[d+(e-l)e,l]. (15) 

Proof: see Appendix. ■ 



B. Asymptotic Regimes 

We now consider the compound outage in © in two limiting regimes: 

1) The uncertainty-dominated regime e — > and fixed d, i.e. the dominant source of outage 
events is from significant deviation of the true channel distribution from the nominal one (outage 
events under the nominal distribution can be neglected). 

2) The nominal-outage dominated regime d — > and fixed e, i.e. when the impact of channel 
distribution uncertainty is negligible as outage events under the nominal distribution dominate 
the performance. 
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Proposition 5: The outage probability P out in © in the low nominal outage regime, s — > 
and fixed d > 0, is as follows: 

+ (16) 



In - - In In 

and the optimal (minimizing) s* in ([8]) is given by 

1 



ln^ -lnln^ 

e e 



(1 + 0(1)). 



This is the the uncertainty-dominated regime (the main contribution to P out is coming from d 
rather than e). 

Proof: see Appendix. ■ 

Further analysis shows that the approximations above (without o(l) term) are accurate provided 
that e < d < 1. Note from (fT6l) that the main contribution to P out is coming from d (i.e. the 
uncertainty) rather than e (i.e. the nominal outage) since hx{d/e) is a slowly-varying function 
of e, so that variations from the nominal channel distribution dominate the outage events. Also 
note that the relative entropy distance d is directly related to the compound outage probability, 
which indicates that it is this distance that should be used as a measure of accuracy in estimating 
the channel distribution from measurements or physical modeling since it is directly related to 
the system performance (outage probability and capacity). 

Let us now consider the nominal outage-dominated regime (i.e. fixed e and d — >■ 0). 

Proposition 6: In the low channel distribution uncertainty regime, d — > and fixed e, the 
compound outage probability is 

Pout = e+ yj2d{\ - e)e + o(Vd) (17) 

and the optimal s* is given by 

Proof: see Appendix. ■ 

Further analysis shows that the approximation above is accurate when d <^ e < 1 and that the 
impact of uncertainty is negligible, P out ~ e, when yd y/e. Comparing Proposition |5] and 
Proposition |6l one concludes that indeed there are two regimes in the behavior of P out (e): 
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1) The uncertainty dominated regime (nominal outage is neglegible), when e < d < 1 so 
that 

d d 
In | -In In | ln(l/e) 
where ~ means "scales as", so that P out depends linearly on d but only logarithmically 

(i.e. very slowly) on e. 

2) The nominal outage-dominated regime (uncertainty is negligible), when d <C e < 1 and 



P out y/2d(l - eje ~ e (19) 

i.e. d contributes very little to the outage probability. 
These two regimes immediately suggest some design guidelines related to the outage prob- 
ability. In the uncertainty dominated regime, the main way to reduce outage probability is via 
decreasing the uncertainty of the channel distribution, e.g. via improved channel measurements 
or modeling; reducing the nominal outage probability is not efficient here, so that minimizing it 
via the optimal transmit covariance (or distribution in the general case) is not worth the effort 
- any reasonable covariance (e.g. isotropic signalling) will do as well. This approach, however, 
will bring little improvement in the nominal outage-dominated regime, where the only way to 
reduce the outage probability is via improving systems performance under the nominal fading, 
e.g. by increasing the SNR or optimizing the transmit covariance. Note that these conclusions 
hold for any nominal channel distribution (e.g. not limited to i.i.d. Rayleigh) and for any noise 
(not only Gaussian). 

As an example, consider a 1 x 1 Rayleigh-fading channel with Gaussian noise, in which case 
e ~ I/7, when d <C 1 and 7 3> 1, so that 

p n J 7 in re S ime 1 ^ ^ l / d ^ rom 

Pouth) ~ < ; (20) 

[ in regime 2 (7 > 1/d) 
i.e. the outage probability scales with SNR 7 as 1/SNR in regime 1 but only as 1/ In SNR in 
regime 2, i.e. increasing the SNR is only efficient in the former case. When del, increasing 
the SNR will first decrease P ou t, but only down to about d and after that point the decrease 
becomes logarithmically slow. From a practical perspective, it means that P out cannot be reduced 
significantly beyond d by increasing the SNR. This observation indicates that the relative entropy 
distance d is indeed an appropriate measure of channel distribution uncertainty in the non-ergodic 
(block-fading) mode. Fig. [3] illustrates the two-regime behavior of P ou t(l)- 
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This two-regime behavior can also be linked to the way channel distribution is obtained from 
measurements: a finite number of fading channel realizations are measured and the empirical 
channel distribution is derived based on it. However, the relative accuracy of this empirical dis- 
tribution is always lower at the distribution tails, where fewer measurement points are available. 
On the other hand, when the average SNR is high, as in regime 2, an outage event takes place 
when the channel is very weak, i.e. at the distribution tail, so that the inaccuracy in the channel 
distribution estimation plays a dominant role there. Ultimately, low compound outage probability 
can only be achieved by insuring sufficiently high accuracy of the estimated distribution tail 
(small d), i.e. when a sufficient number of independent measurements fall into that region. 




10 20 30 40 50 60 70 80 90 100 
SNR, dB 



Fig. 3. Two-regime behavior of the compound outage probability. Its approximations in d!8t and d 1 9b and nominal outage 
e = 1/SNR (set for convenience) are also shown; d = 10~ 3 . Note that both approximations are accurate in their respective 
regimes. Decreasing the compound outage probability beyond about 10 _3 (~ d) requires exponentially high SNR and is not 
practical (it takes 60 dB extra to go from 10~ 3 to 10 -4 , while normally, i.e. without uncertainty, it would take only 10 dB). 

C. Step 2: Minimizing over the Tx Covariance 

Using Theorem \T\ we are now in a position to characterize the compound outage probability 
in ©• 

Theorem 2: Consider a class of fading channels in ©. Its compound outage probability in 
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© can be found from 



P* out = mm[s ln(l + (e 1 ^ - 1) £ *) + sd] (21) 

s>0 



where e* = min tr .R, x <p T e(R x ) is the optimized nominal outage probability, so that the outage- 
minimizing covariance for the class of channel distributions in © and for the nominal distribution 
/o are the same, 



Proof: 



arg min P out (K x ) = arg min e(R x ) (22) 

trR x <P T trR x <P T 



P* ut = min PoutCR* 

trR x <P T 



= min minis ln(l + (e 1 ^ — l)e) + sd] 

trR x <P T s>0 

= min[s ln(l + (e 1/s - l)e*) + sd] 



s>0 

where (a) follows from dH) and (b) follows from the fact that min irRx <p T and min s > can be 
swaped and ln(-) is a monotonic function, so that the minimization of the compound outage 
over R x is equivalent to the minimization of the nominal outage. ■ 

A significance of Theorem |2] is that the infinite-dimensional optimization in © is reduced to a 
one-dimensional convex optimization in (|2"Tj) and, furthermore, the optimal Tx covariance is the 
same as for the nominal channel, so that a significant number of known results lfT1ll9l-|[TT |[12 l 
apply directly to the compound fading channel as well, i.e. no new search of optimal covariances 
is required. 

When the compound outage P* ut in (12TI) is considered as a function of the distance d, P ( ^ tt (d), 
its properties mimic those in Proposition [T] with the substitution £—)■£*. Also, the results and 
conclusions of Section IIII-BI hold under this substitution. In particular, optimizing the transmit 
covariance (distribution) is worth the effort only in the nominal outage dominated regime. 

IV. Compound Outage Probability for the D(f Q \\f) < d Class 
Since relative entropy is not symmetric, i.e. D(/||/ ) ^ D(f \\f), we consider in this section 



the constraint 



D(fo\\f) = J fo\n^jdU<d (23) 
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to see the impact of the order on the obtained results. One property of the compound outage 
probability P out in © for this class of distributions is immediate. 

Proposition 7: For a given Tx covariance matrix, the compound outage in (|7) under the 
distribution class in (|23l ) is bounded as follows: 

Pout >l-e~ d + e- d e (24) 
> 1 - e- d (25) 

and the bounds are achievable. When d <C 1, 

Pout >d + e (26) 

Proof: (l24l) follows from the fact that the lower bound is achievable by 

/(H)=p*(H) + (l-p)/ (H) 

where p = 1 — e~ d and £(•) is the Dirac delta function, i.e. by the distribution that mimics 
the nominal one except for placing mass p at zero (where the mutual information is zero). The 
bound in (1251) is trivial and is achievable when e d — 1 3> e. (l26l) follows from e~ d rs 1 — d for 
d < 1. ■ 

An important conclusion is immediate from (|25l : P ou t{s — 0) > 1 — e~ d , i.e. there is a 
saturation (or error floor) effect in the behavior of P ut{^)'- even though e — V (e.g. by SNR — » 
oo), P out ^> 0. From (I26T ). P out > d when d 1 in this regime, i.e. cannot be made smaller than 
the relative entropy distance d, does not matter how large the SNR (or how small the nominal 
outage) is. This is in contrast to (fT6l) . where P out — > when e — > 0, even though logarithmically 
slowly (i.e. no saturation). The absence of saturation in the latter case should not however 
be overestimated, since the convergence P out — > is logarithmically slow in e, i.e. requires 
exponentially large SNR, so that for all practical purposes, P out also saturates around d, as was 
indicated in Section IIII-Bl Note also that ([26b places d and e on equal footing, re-enforcing our 
earlier conclusion that d is an adequate measure of fading uncertainty in the non-ergodic regime. 

Let us now characterize precisely the compound outage probability for the distribution class 
in d23j. 
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Theorem 3: Consider the compound outage probability for the distribution class in (|23l and 
a given Tx co variance R x , 

P OMt (R x )= max Pr{/(x;y|H) < R} (27) 

D{fo\\f)<d 

It can be expressed in the following form: 

Pontic) = J—[ £ ( 28 ) 

where fi > 1 is a unique solution of 

ML^tL = e -t (29) 
fjL — l + e 

and A* = e~ d (fi — l) e /i 1_£ . The maximizing density /* in (|27l) is given by 



r = A *f° v (30) 



An alternative characterization of P n „+(R X ) is 



Po«t(R x ) = min 

A>0 



X: ( ^— + 1x1-^-^1 +\(d-\n fI 



where 



\i — 1 — V A 

= ~ (l + A + v/(l-A) 2 + 4Ae) 



(3D 



(32) 



Proof: see Appendix. ■ 
Note that Theorem |3] reduces the infinite-dimensional optimization problem in (1271 ) to the 
closed-from solution in (|28T) that depends on a unique solution of a single monotonic non- 
linear equation in (|29l) (can be efficiently found using any of known numerical techniques). The 
alternative representation in (|3H is a one-dimensional convex optimization problem. It follows 
from (|30l) that the maximizing density /* mimics the nominal one f , albeit with different 
constants in the outage and no outage sets, 

f -^rf Q , if H G O 

f* = { (33) 
( f/o ifH^O 

where O = {H : £(H) = 1} is the outage set, so that /* is a scaled up version of f in the 
outage set and scaled down otherwise! (to keep the normalization fixed), which is what one 
would intuitively expect to maximize the outage probability. Fig. |4] illustrates this behavior. 

8 Note that A* < fj, < 1 + A*, with 1st equality iff e = and 2nd one iff e = 1. 
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fo 




Y 

outage set: 



V 

no outage set: 
H£0 



Fig. 4. The worst channel distribution over the nominal one. Note that the worst one is a piece-wise constant scaled version 
of the nominal one, which follows the same tendency as for the other uncertainty set. 



Let us now consider asymptotic regimes. 

Proposition 8: The compound outage probability P ou t(R^) in Theorem |3] behaves in the low 
nominal outage regime e — >• and fixed d as follows: 

P ou t = l-e- d + o(l). (34) 

Proof: see Appendix. ■ 

Note that (O implies that P ou t{.£ = 0) = 1 - e~~ d , i.e. the lower bound in (|24j is tight. It 
also re-affirms the saturation (error floor) effect discussed above. It is also interesting to note 
that the error floor 1 — e~ d depends on the distance d only, all other details (e.g. the nominal 
distribution / ) being irrelevant. Further analysis shows that the approximation in (|34b is accurate, 
i.e. P out «1- e~ d , provided that £<e <i -l. This is also consistent with the lower bound in 
(l24b (3rd term is negligible under the latter condition). 

Proposition 9: The compound outage probability P ou t(R x ) in Theorem |3] behaves in the small 
uncertainty regime d — > and fixed e as follows: 

P ou4 = e+ y/2e(l-e)d + o (y/c?) (35) 
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Proof: see Appendix. ■ 

It is clear from (1331 ) that P out (d — 0) = e, so that the lower bound in (1241 ) is tight in this case. 
It is also obviously tight when e — 1 and, as we seen in Proposition [8l in the e = case. In 
fact, numerical analysis shows that this lower bound is a good approximation of P out over the 
whole range of e - see Fig. |5] Comparing (1351) to (flTT) . we conclude that the compound outage 
probability is the same for the D(f\\f ) < d and D(f \\f) < d uncertainty sets in the low 
uncertainty regime (i.e. the asymmetry of relative entropy does not have any effect here) while 
the same cannot be said about the low nominal outage regime (compare (l34t to (fToTl). 

It can be shown (via numerical analysis - see Fig. [5]) that the approximation in (1351) is accurate, 
i.e. 



P out y/2e(l-e)d (36) 

when d < e and that the effect of uncertainty is negligible, i.e. P out ss e, when d «C e, which 
parallels the conclusions made in Section IHI-Bl 

Fig. [5] illustrates the two-regime behavior of the compound outage probability and validates 
the approximation above. It also demonstrates that the lower bound in (l24l) is quite tight over 
the whole SNR (nominal outage) range and thus can be used instead of the true compound 
outage probability for design/analysis purposes to estimate the impact of channel distribution 
uncertainty. 

We would like to emphasize that the results above hold for an arbitrary nominal channel 
distribution f (e.g. Rayleigh, Rician, Nakagami, Log-Normal, correlated and/or non-identically 
distributed, etc.) - it enters into the compound outage probability only via the nominal outage 
e, and also for arbitrary noise. 

When the compound outage probability is minimized over transmit co variance (distribution) 

P* = min max Pr{/(x; y|H) < R] (37) 

some of the results above (namely, (|24l )-(l26l). (|34T ). and (|35l) . (l36l) when e < 1/2) apply directly 
via the substitution £—>■£*, i.e. the optimal transmit covariance is the same as for the nominal 
outage. Furthermore, while optimizing the covariance to reduce the nominal outage also reduces 
the compound outage probability in the low uncertainty regime (i.e. in (1351)). it has negligible 
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Fig. 5. Two-regime behavior of the compound outage probability and its lower bound in l !24t . the approximation in ( 136) . and 
the nominal outage e = 1/SNR; d — 10 -3 . Note the saturation (error floor) effect: the compound outage cannot be reduced 
below 10 (= d), in agreement with d25t . does not matter how high the SNR (or how low the nominal outage) is. 



impact in the uncertainty-dominated regime in (|34l ) so that any reasonable covariance (e.g. 
isotropic signalling) will work equally well (provided that e* < e rf - 1). 

V. Compound Outage Capacity 

Let us now consider the compound outage capacity, which extends the definition of outage 
capacity in © to a class of distributions. Consider first the case when the transmit covariance 
is fixed: 

C 5 = max{R : max Pr{/(x; y |H) < R} < 5} (38) 

where / belongs to a distribution uncertainty class, D(f\\f ) < d or D(f \\f) < d. 

It is clear that C§ < Cos in general, where the optimization over the transmit covariance is 
either used or not in both cases simultaneously. Using the compound outage probability results 
obtained above, the compound outage capacity can be characterized more precisely. 

Proposition 10: The compound outage capacity in the low-uncertainty regime d e < 1 for 
both uncertainty sets, D(f\\f ) < d and D(f \\f) < d, equals the nominal outage capacity, 

C 5 w C os (39) 
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Proof: Follows directly from ([191 ) and (l36t . ■ 
Proposition 11: The compound outage capacity for the D(f \\f) < d uncertainty set satisfies 
the following inequality, 



C s = V5 < 1 - e~ d (40) 

In particular, no transmission is possible at S < d: C$ = V5 < d. 

Proof: Follows from (|24l and from 1 — e~ d < d. ■ 
Proposition 12: The compound outage capacity for the D(f \\f) < d uncertainty set in the 
d <C 1 regime satisfies the following inequality, 

Cs < Co((J-d) 

i.e. d serves as a measure of loss in the target outage probability due to the channel distribution 
uncertainty. 

Proof: Follows from (ED and 1 - e~ d + e~ d 5 w d + 5 for d < 1. ■ 
When optimization over the transmit covariance (distribution) is done, the results above also 
apply with the substitution e e* and Cos meaning the optimized nominal outage capacity. 

VI. Compound Outage Probability via L p Distance Constraint 

Let us now consider the channel distribution uncertainty class of the form ||/ — / || p < d, 
where ||/|| p = (/ \f\ p dH)^ p is L p norm, p > 1. It can be characterized as follows. 

Proposition 13: The compound outage probability for the ||/ — / || p < d uncertainty class, 
p > 1, can be bounded as follows: 

P p = max Pr{/(x;y|H) < R} (41) 

\\f-fo\\p<d 

> Pi (42) 
>m a x[PUd 2 /2),PUd 2 /2)] (43) 

> i _ e -d 2 /2 + e -dV2 £ (44) 

> 1 - e- d2/2 (45) 

where Pl ut (d) and P^id) are the compound outage probabilities as functions of distance d for 
the D(f\\f ) < d and D(f \\f) < d classes. 
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Proof: (1421) follows from the norm inequality ||/ — / || p < ||/ — /o||i- (1431 ) follows from 
Pinsker inequality D(f\\f ) > |||/ - / ||? [27] and the fact that ||/ - / ||i = ||/o - ® 
and d45]) follow from (l24l ■ 
Note that there is an error floor effect here as well: P p (e = 0) > 1 — t'^l 2 > Vd > 0. 
The p = 2 case corresponds to the widely-used mean square error criterion (including channel 
estimation and measurements), so that there is an error floor for the MSE uncertainty class as 
well. Finally, when the compound outage probability is minimized over the transmit covariance 

1(2) 

(distribution), the same inequalities hold (where P a ^ t and e are also minimized), and the error 
floor is not affected by this. When d is small (so that the deviation of / from f is also small), 
the L p and relative entropy distances can be shown to have the same order of magnitude, so that 
the corresponding compound outage probabilities will scale similarly. 

VII. Conclusion 

Compound outage probability and capacity of a class of fading MIMO channels with partial 
channel distribution information have been introduced and studied. These concepts generalize 
well-known and widely used concepts of outage probability and capacity of fading channels with 
completely known distribution to the case where only partial knowledge of distribution is avail- 
able. Relative entropy distance is used as a measure of uncertainty, which is shown to be related 
directly to the compound outage probability. Since relative entropy distance is not symmetric, 
two uncertainty classes are considered and worst-case fading distributions are identified for both. 
A number of properties, bounds and approximations of the compound outage probability are 
given. The transmit covariance matrix optimized for the nominal outage probability is shown to 
be also optimal for the compound one. The nominal fading distribution enters into the compound 
outage probability only via the nominal outage probability, all other details being irrelevant, i.e. 
two different nominal distributions having the same nominal outage will also produce the same 
compound outage probability. Behavior of the compound outage probability reveals two distinct 
regimes: uncertainty-dominated regime and nominal-outage dominated one. While increasing 
the average SNR or optimizing the transmit covariance to reduce the outage probability is 
effective for the latter, it has only negligible effect in the former, which immediately suggest 
a design alternative (via reducing uncertainty rather than increasing the SNR or optimizing the 
transmit covariance). All these results are very general as they hold for arbitrary nominal fading 
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distribution and also for arbitrary noise (i.e. not only Gaussian). 
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IX. Appendix 

A. Proof of Theorem [7] 

The problem in © can be presented in this form lt- 

max J £(H.)fdH, s.t. J flnjrdH < d, J fdH = 1. (46) 

The problem is clearly convex (since the objective is linear and the constraint set is convex). 
Furthermore, strong duality holds (i.e. the duality gap is zero), since Slater condition (see e.g. 
11231 ) is satisfied. Therefore, the KKT conditions are sufficient for optimality [23]. The Lagrangian 
of this problem is 

L = J £{H)fdH - s (^j / In jdH - d^j - // (j fdH - 1^ , (47) 

and, taking the variational derivative (see e.g. If24l ) of L with respect to /, one obtains the KKT 
conditions: 

£(H) - s ( In i- + 1 J - fi = (48) 



fo 

fdH -1=0 (49) 



/ In ^-dH -d) =0 (50) 



s>0 (51) 



fo 



It is straightforward to see that the complementary slackness condition (1501) implies that the 
second term is zero, i.e. the optimum is achieved on the boundary. Using (1481 and (l49l . one 
obtains, after some manipulations, 

e m)/s* 

f* — /o^— — / tjt* — tt (52) 
1 + e(e L ' s — 1) 

9 An additional constraint / > is not included since our solution explicitly satisfies it. 
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where /* and s* are the solutions of (|48T)-(|5TT), from which ([TOl follows. To obtain ©, note 
that, due to zero duality gap, the solution to (|46T ) equals to that of the dual problem, 

P oui (R x ) = mm L{s) (53) 

s>0 

where L(s) is the dual function, 

L(s) = max{L} s.t. (|49T) 

= s\n(l + (e 1/s -l)e) + sd (54) 

and n was eliminated using (|49| ). Since L(s) is the dual function, it is convex. Furthermore, 
when < e < 1, 

<i 2 L(s) _ e(l - £)eV s 

- S 2(! + ( e i/ a _ 1)£ )2 > 

for < s < oo, i.e. strictly convex, so that the problem in (1531 has a unique solution. Since 
Pout = £ when e = 1,0 (see Proposition O, the problem in (|53l) has always a unique solution, 
which can be found from dL(s)/ds = using any known numerical algorithm. The tools of 
asymptotic analysis (e.g. E5lll26l ) allow one to obtain a number of approximations. 

B. Proof of Proposition [7] 

1) To prove item 1, note that P out in © is a point- wise minimum of a set of affine functions 
of d (indexed by s) and therefore is concave (see e.g. [23 j). 

2) Using Lyapounov inequality, 

(E\x\ p ) 1/p < (E\x\ q ) 1/q 

where < p < q and E denotes expectation, for random variable x = and p — 1/si, q — 
1/^2, < S2 < si, one obtains 

( Ee m)/siy> < ( Ee ^ H )/^) S2 

and taking log, 

Sl lnEe^/ Sl <s 2 lnEe i ^ S2 
so that F(s) = slnEe £ ( H )/ s = sln(l + (e 1//s — l)e) is a non-increasing function of s. Therefore, 

P out (d = 0) = minF(s) = lim F(s) = e (55) 

s>0 s— >oo 
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3) Define L<(s) = sln(l + (e 1/s - + s^, i = 1,2, d x < d 2 . Then, Li(s) < L 2 (s) with 
equality iff s = 0, in which case Lx(s) = L 2 (s) = 1 or (if e > or = 0). Taking min s > of 
both sides, one obtains P out {di) < P ou t(d 2 ) with equality iff P out {di) = P out {d 2 ) = 1,0. 

C. Proof of Proposition [2] 

1) Using (E2>, 

P °«* - i + e(e i/ a * _ i) (56) 
from which it follows that P out — 1 iff £ = 1. 

2) Item 2 is proved in the same way as item 1 above. 

3) The inequality is due to that fact that the distribution uncertainty set always includes 
the nominal distribution. The "if part follows from items 1 and 2 above and from item 2 of 
Proposition [TJ The "only if part is verified using (156T ). 

D. Proof of Proposition \3\ 

It mimics the proof of item 3 in Proposition [TJ Define Lj(s) = sln(l + (e 1 ^ — l)si) + sd, 
i = 1,2, Ei < e 2 . Then, Li(s) < L 2 (s) with equality iff s = 0, in which case Li(s) = 
L 2 (s) = 1. Taking min s > of both sides, one obtains P ou t{si) < Pmit{ £ 2) with equality iff 
Pout(^i) = Pout{£2) = 1, which is possible only if £i = e 2 — 1. The concavity of P ut{^) 
follows from the fact that it is a point-wise minimum of a set of concave functions of e (indexed 
by s), see (l53l) . 

E. Proof of Proposition 

The lower bound was proved above. The upper bound follows from the following: 

P out = min[sm(l + (e 1/s - l)e) + sd] 

s>0 

< ln(l + (e - l)e) + d 
<{e-l)e + d 

where 1st inequality is obtained by setting s = 1 and 2nd one follows from ln(l + x) < x for 
x > 0. 
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F. Proof of Proposition \5\ 

Since L(s) in (fl"2l) is convex, a unique minimum in © can be found by setting the derivative 
to zero, dL(s)/ds = 0, which can be expressed as 

d + ln(l + 2/) = f±^ln(l + ^) (57) 



1 + y V e 

where y = e{e 1 ^ s — 1). It is straightforward to see that y — >■ as e — > so that (1571) can be 
transformed to 

where we have used ln(l + ?/)=?/ + o(y) and 1/(1 +y) = 1 — y + o(y). Using the techniques 
of asymptotic analysis (see e.g. Il25lll26l ). the following equation 

x In x = u (59) 

has the solution when u — > oo, 

lnw — lnlnw ( (mu/J ^ ^ 

Setting a; = 1+y/e = e 1 ^, u = (l+o(l))d/e, one obtains, after some lengthy but straightforward 
manipulations, the solution y* of (1581) and corresponding s*: 

d 

ln^^Tnb 

e 

1 



In a - In In a 

e e 

so that, after some further manipulations, 



;i+o(i)) 



P«t = L(0 = ln ,_ rf lnln , (l + o(l)) (61) 

e e 

G. Proof of Proposition \6\ 

In the d — > and fixed £ regime, the minimizer s* in ([8]) is s* — >• oo (as follows from (l55t). 
so that y* je = e l l s * — 1 — v and (1571 ) can be transformed to 

i(!)V £)+0(y2) rf 

from which one obtains 



Ifd 

1 • oil)) 



1-e 
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so that 

'E(l-e) 



2d a + o(D) 

and 

P out = e+ ^2d{l - e)e + o{Vd) (62) 

Further analysis shows that the approximation above (without o(\fd) term) is accurate when 

d < £ < 1. 

//. Proof of Theorem \3\ 

This proof follows along the same lines as that of Theorem \T\ which is summarized below. 
The optimization problem is: 

max J £(U)fdU, s.t. J f lujdU < d, J fdH = 1. (63) 

The problem is also convex and strong duality holds, so that the KKT conditions are sufficient 
for optimality. The Lagrangian is 

L = J £(H)/dH - A (J /olnydH -d^j-^^j fdH - lj , (64) 



and the KKT conditions are: 



;H) + - fi = (65) 
fdH -1 = (66) 
A I / f In fydH - d J = (67) 



/ 

A > (68) 



After some manipulations, one obtains 



/* = A *f° (69) 



fji-e(H 

where /* and A* are the solutions of (|65T)-(|68T). A* is found from (167) 



A* = e" d (/i - l) £ ^' £ (70) 
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and ji is found from (|66l ) 



/i — 1 /i 

which can be transformed, after some manipulations, to (|29l ). 

The alternative characterization of P out (Rx) follows from the dual problem, 



P out (R x ) =minL(A) (72) 

A>0 



where L(A) is the dual function, 



L(A) = min{L} s.t. ([66]) 

= Ae + In + A(d-lnf) (73) 

— 1 /U — 1 / V A/ 



(74) 



and /i is found from (1711 ), 

\i = ~ (l + A + V / (1-A) 2 + 4A £ ) . 



/. Proof of Proposition \8\ 

It is straightforward to see that the left-hand side of (1291 is a strictly increasing function of 
fi, V/i > 0, G (0, 1), which takes values in [0, 1], so that a solution is unique. It also follows 
that jj, — > 1 as e — > so that expanding fi(e) in Taylor series, jj, — 1 + ai£ + o(£:), one obtains 
from (1291) 

In — — + oil) = -d (75) 
1 + «i 

where we have used In /i = c^e + o(e), from which it follows that 



= 1 + + o(e) (76) 

e a — 1 



and also A = e d (l + o(l)), so that 

A 



out = -e = l-e~ d + o(l) (77) 



/i — 1 

(1761 ) suggests that the approximation in (1771 ) (without o(l) term) is accurate when e < e rf - 1, 
which is also confirmed by numerical analysis. 
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J. Proof of Proposition \9\ 



It follows from (l29l that \i — > oo as d — > so we use the expansion 



i 1 /i) fi + 2fi 2 + ° (^ 2 ) 



(78) 



to transform (1291 to 




^yS (1 + o(1)) 




(79) 



(80) 



and also that A* = + o(l)). Using these, one finally obtains 



Pout = -e = £ + y/2e(l-e)d + o(y/d) 



l» 



(81) 
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